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Abstract 

A single and self-contained method for determining the crystallite-size distribution 
and shape from experimental x-ray line profile data is presented. We have shown that the 
crystallite-size distribution can be determined without assuming a functional form for the 
size distribution, determining instead the size distribution with the least assumptions by 
applying the Bayesian/MaxEnt method. The Bayesian/MaxEnt method is tested using 
both simulated and experimental CeC>2 data. The results demonstrate that the proposed 
method can determine size distributions, while making the least number of assumptions. 
The comparison of the Bayesian/MaxEnt results from experimental Ce02 with TEM 
results is favorable. 1 

1 Introduction 

The analysis of x-ray line profile broadening can be considered as solving a series of 
inverse problems. There are usually two steps: — removing the instrumental contribution 
(deconvolution), and determining the broadening contribution in terms of crystallite size 
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and microstrain. Here we are concerned with quantifying only the size broadening, in 
terms of the shape and size distributions of the crystallites. We present a method that 
removes the instrumental broadening and determines the particle size distribution in a 
single step. The general theoretical framework developed makes it possible to determine 
the crystallite shape and average dimensions, and to fully quantify these results by also 
assigning uncertainties to them. 

There are two approaches that can be adopted. The first assumes functional forms 
for the size distribution and shape of the crystallite, and applies a least squares fitting to 
determine the parameters defining the size distribution (Krill & Birringer 1998, Langford 
et al. 2000). For pragmatic reasons, this approach is often used to ensure numerical 
stability; however, it is based on an explicit assumption for the crystallite size distribution 
and does not take into account the uniqueness of the solution. 

The second approach takes into account the non-uniqueness of the problem of deter- 
mining the size distribution P(D) from the experimental data, by assigning a probability 
to the solutions and enabling an average solution to be determined from the set of solu- 
tions; moreover, it also allows any a priori information and assumptions to be included 
and tested. This approach is embodied in the Bayesian and maximum entropy methods 
(Gull 1989, Skilling 1989a, Bryan 1990, Sivia 1996). Essentially, Bayesian theory tells us 
how to express and manipulate probabilities. It might be said, therefore, that Bayesian 
theory helps us to ask the appropriate questions, while the maximum entropy method 
tells how to assign values to quantities of interest. 



where k{29) defines the instrument profile and considers the imperfect optics of the diffrac- 
tometer; f(20) is the specimen profile, which (apart from strain effects which are not 
covered here) characterizes the size broadening due to microstructural properties of the 
specimen (i.e. crystallite shape, distribution and dimensions); b{29) and n(26) are the 
background level and the noise distribution, respectively. The observed profile, can 



2 X-ray line profiles 



2.1 Observed profile 



The observed line profile, g(20), can be expressed as 




(1) 
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also be expressed in terms of reciprocal-space units, s, centered about so = 2 s ™ e ° , as 

g (s)= g (26)^l (2) 

where d(20) = ^ ds. 

The problem we face is determining the size distribution and shape of the crystallites 
from given our knowledge of the instrument kernel, k{29), and our understanding 
of the counting statistics, a 2 . We also want to quantify the specimen profile and size 
distribution by assigning error bars to them. Before addressing these questions, we review 
line profile broadening from nanocrystallites. 

2.2 Crystallite-size broadening 

The line profile, I p (s, D), from a specimen consisting of crystallites of the same size and 
shape can be expressed in terms of the common-volume function (Stokes & Wilson 1942) 

I p {s, D) = 2 I V(t, D) cos2irstdt (3) 
Jo 

where I p (s, D) is the intensity profile given by the dimensions of the crystallite, D = 
{Di] i = 1,2,3}. The common-volume function of the crystallite, V(t, D), quantifies the 
volume between the crystallite and its 'ghost', shifted a distance t parallel to the diffraction 
vector. The dimension r represents the maximum length of the crystallite in the direction 
of the diffraction vector, and can be expressed in terms of the dimensions of the crystallite, 
D, such that r = r(D). The boundary conditions for the common-volume function are 
V(0, D) = Vo, where Vq is the volume of the crystallite, and V(±r, D) = 0. Fig.^shows a 
schematic of a crystallite and its ghost shifted a distance t in the direction [hkl]; the shaded 
region represents the common volume between the crystallite and its ghost. V(t, D) is 
symmetrical about the origin over the range t G [— r, t\. This implies that V(t, D) is 
an even function over this range. A simple example is a set of spherical crystallites with 
diameter D, for which the common-volume is given by (Stokes & Wilson 1942) 

V(t, D) = ^(t + 2D)(t-D) 2 (4a) 

and using Q the corresponding line-profile is (Stokes & Wilson 1942, Langford et al. 2000) 

1 D 2 cos(27rsL>) D sin(27r,sL>) 

P(S ' } " 16^4 + ^2 " l 6vr 3 s 4 ^3 ( 4b ) 
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Figure 1: The crystallite (solid line) and its 'ghost' (dashed line) shifted a distance t in the direction of the 
scattering vector [hkl]. The crystallite and ghost have dimensions, D = {Di, D2, D3}. The shaded region 
represents the common volume between the crystallite and ghost. The maximum thickness of the crystallite 
in the direction [hkl] is r. The common-volume function has the boundary conditions, 1^(0, D) = Vq and 
V(±t, D) = 0. As t — ► r, V(t, D) — * 0. The graph of this function over t represents the Fourier coefficients 
from which the area- and volume-weighted sizes can be determined. 

where r{D) = D for spherical crystallites and in the limit of s — > (|4b[) reduces to 
J p (0, D) = ttD 4 /8. 

Essentially, © is the Fourier transform of the V(t, D), and noting V(t, D) is an even 
function, the odd (sine) terms in the Fourier transform vanish. This also implies that 
the size-broadened profiles will always be symmetrical about the Bragg angle, 26q. From 
(J2J) and Fig. ^ it is clear that information concerning the dimensions and shape of the 
crystallite is given in V(t, D). 

2.3 Particle-size distribution, P(D) 

A powder specimen would not normally consist of crystallites all having the same size, 
but it can be assumed that the crystallites can have the same shape, based on kinetics 
arguments. The effect of the particle-size distribution on the common volume is to 'blur' 
the broadening effects of a single crystallite. 

The size-broadened line profile from a distribution of crystallites, P(D)PD, with 
dimensions in the range D to D + DD can be expressed as 

POO 

f(s) = 2 V(t) cos2vrstdt, VsG [-co, +00] (5) 
Jo 
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where V(t) is the modified common-volume function due to the influence of the particle- 
size distribution, 

/oo 
V(t, D)P(D)£>D. (6) 

In © a generalized measure, PD, has been used which is dependent on the crystallite 
shape and coordinate system. The area- weighted size, (t) a , volume- weighted size,(t) v , and 
column- length distribution (or area- weighted size distribution), p a (t), can be determined 
from © (Wilson 1968, Wilson 1971). It can be seen from (jSJ) how the shape and distribu- 
tion of the crystallites influence the area- and volume- weighted quantities. Substituting 
© into ©, we have 



/(') 



f'oo r roc 

2 / V(t, D)P(D)DD 

Jo Ut 

roo r pt 

\ 2 V(t, D)cos2vrstdt 
Jo l Jo 



cos2irstdt (7a) 
P(D)£>D (7b) 



where in going from (|7aj) to (|7b|) the order of integration has been changed and t is 
integrated out. In addition we note that V(t, D) > for t G [0, r] and y(t, D) = for 
t > t. Inside the brackets of ()7b|) . we have i p (s, D) from ©. Hence, (|7b|) can be written 
as 

/•oo 

f{s)= I p {s, D)P(D)PD, Vse[-oo, +oo] (8) 

where we define the profile kernel, I p (s, D), as the size-broadened line profile given by a 
single crystallite with dimensions D. In (JBJ, we notice that the effect of P(D) is to weight 
the superposition of size profiles over the range of D to D + DD. 

2.4 Determining P(D) from g(s) 

In analysing the size distribution, we want to ensure that the statistics of the observed 
profile can be carried directly into quantifying the size distribution. Equation (jSJ) expresses 
the specimen profile, /(s), in terms of the particle-size distribution and the shape of 
the nanocrystallites, while after transformation into s-space, expresses the observed 
profile in terms of f(s). Combining these two equations, the experimental data, g(s), can 
be expressed in terms of the particle-size distribution, P(D) as 
-+oo r+oc 



r+oo /"i-oo 

g(s) = / k(s - s') I p (s' , V)P(D)ds'Vn + b(s) + n(s) (9a) 

Jo J-oo 
r+oo 

= / K(s, D)P(D)PD + 6(s) + n(s) (9b) 
Jo 
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where the scattering kernel, K(s, D), 'rolls up' the instrumental effects and the profile 
kernel, given by 



In ()1U|) . the dummy variable s 1 is being integrated out. The results given by ()9b|) and (|l(Jj) 
enable the particle-size distribution to be extracted directly from the experimental data. 
This ensures that the statistics of the experimental data are transferred to quantifying 
the uncertainty in the solution. This approach also addresses a difficulty of the two-fold 
approach discussed by Armstrong & Kalceff (1999). 



In © we have a single expression for the observed profile in terms of the crystallite size 
distribution and shape, background level, and statistics of the experiment; information 
concerning the crystallite properties has been incorporated. 

In seeking to determine -P(D) from g(s), the issue of uniqueness for -P(D) becomes 
important, for two reasons: firstly, because of the 'conditioning' of the kernels, particularly 
K(s, D); and secondly, due to the presence of statistical noise, a. 

Generally, K(s, D) will be ill-conditioned. This can be demonstrated in a numerical 
calculation by expressing K(s, D) as a matrix, K; we can show det K T K ~ 0. This 
implies that the column vectors of K are (nearly all) linearly dependent, which has dire 
consequences, as any attempt to determine -P(D) (given g(s), K(s, D), a and b(s)), pro- 
duces a set of solutions {P(D)} rather than a unique solution. The presence of statistical 
noise in the data simply worsens the situation, in that the ill-conditioning of K(s, D) 
amplifies the noise and the solution is swamped by spurious and unphysical oscillations 
(Armstrong & Kalceff 1998). Faced with this situation, the following question arises: 

How do we develop a method to extract a unique P(D) from g(s), given our knowledge 
ofK(s, D), b(s) and a 2 ? 

3.2 Some observations 

Before proceeding with developing a 'method' to determine the crystallite size and shape 
from the observed data, g(s), some observations concerning these distributions need to be 




(10) 



3 Bayesian & maximum entropy methods 



3.1 The uniqueness of P(D) 
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made. 

The integral equations given by Q and Q refer to a set of continuous functions. 
However, the recording of the observed and instrument profiles is made in discrete time 
intervals. To convey this, we express the observed profile, specimen profile and size distri- 
bution as vectors, such that g = {gf, i = 1, 2, 3, . . . , M}, f = {fj>; f = 1, 2, 3, ... , N'} 
and P = {Pj', j = 1, 2, 3, . . . , N}. The scattering kernel K(s, D) can be expressed 
as matrix, K = {Kij\ V i&j} by taking the product of the instrument kernel and 
the line profile kernel. The instrument kernel can be evaluated in 2#-space, such that 
R = {k(26i - 2%,); i = 1, 2, 3, . . . , Mkf = 1, 2, 3, . . . , N'}, and using d(20) = ^ ds 
can be mapped into s-space. Similarly, the profile kernel can be evaluated over s and D, 
such that I p = {Ipj'j] j' = 1, 2, 3, . . . , N' k,j = 1, 2, 3, ... , N}. The matrix product 
gives K = Rip and is an [M x N] matrix, such that N < N' < M. 

There are two fundamental properties which g(29), f(29), P(D) and V(t, D) all share. 
The first is that these distributions are positive definite; that is, the observed profile g(29) 
and specimen function f{29) represent intensities which are positive values. The second 
property is that these distributions are additive] that is, the sum of the distributions 
over a region represents a physically meaningful quantity (Sivia 1996). For example, the 
integrated intensity of g(s) can be related back to the structure factor of the lattice, while 
the integrals J f(s) ds and j V(t, D) dt are inversely proportional to the integral breadth 
and quantify the specimen broadening in terms of size and strain contributions. The 
integral for P(D) is a special case, in that it must be unity. This ensures that we can 
attribute a probability for a particular D and detemine its moments. 

These two observations are important in formulating a 'method' that can determine 
both the specimen profile from the observed x-ray diffraction profile and an underlying 
distribution such as the size distribution, P(D), while dealing with the issue of uniqueness. 
That is, we expect our method to extract this information from the observed data and 
produce results which preserve the positivity and additivity of the profile or distribution. 
It should also be possible to incorporate the properties of positivity and additivity without 
making additional assumptions about, say, the functional/analytical form of the specimen 
profile or size distribution. These conditions ensure that the specimen profile or size 
distribution determined from the observed profile can be interpreted in general terms. 

In order to assign values to these distributions and preserve their additivity and pos- 
itivity, a suitable function must be selected. Based on these observations and various 
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arguments, the entropy function and maximum entropy principle are found to be the only 
consistent approach to inferring discrete probabilities (see Shore & Johnson 1980, Johnson 
& Shore 1983, Tikochinsky et al. 1984, Gull & Skilling 1984, Skilling 19896, Skilling 1990, 
Goambo & Gassiat 1997). 



In analyzing size-broadened profiles, the central aim is to quantify the shape and size 
distribution of the crystallites, given the experimental data. Bayesian theory is well 
suited for testing a hypothesis in the presence of experimental data. This is achieved by 
quantifying the a posteriori probability distribution for P, conditional on the experimental 
data and statistical noise. The formulation of Bayes' theorem is general and can also be 
applied to determining f . 

Using Bayes' theorem, the a posteriori probability for P is given by 



This is conditional on everything after '|', viz. the observed profile g, an a priori model m, 
the scattering kernel K, statistical noise a, a constant a, and any additional background 
information concerning the experiment, I. 

On the right-hand side of Ijllj) there are several terms that require further discussion. 

The likelihood probability distribution Pr(g| P, K, a, I) defines the probability of 
measuring g, given a size distribution P, profile kernel K, and statistical noise a. That is, 
we include our hypothesis P, and determine how probable it is to measure g, given this 
hypothesis, K and a. The likelihood function is approximated as a Gaussian distribution 
for large counts (>> 10) by applying the central limit theorem, 



3.3 Bayes 



' theorem for P(D) 



Pr(P|g, m,K, a,a,l) 



Pr(P| m, a, 1) Pr(g| P, K, a, 1) 
Pr(g| m, K, a, 1) 




Pr(g| P, K, a, 1) 



1 




(12a) 



Zl(o-) 



where 




(12b) 



and 



M 




(12c) 



i=l 




(12d) 



Armstrong et al. (zUUl J. naycsian inference 01 nanoparticle- broadened x-ray line profiles 



y 



such that {V27T a 2 } is an [M x M] diagonal matrix. 

The variance is defined in terms of the observed counts and estimated background-level 
as af = gi + bf st . In Q12a|) . the kernel, K has been included as it contains information 
about the shape of the crystallites and will influence the solution. We notice from (|12b[) 
that the matrix form of © has been incorporated. 

The term Pr(P| m, a, X) defines how probable is our hypothesis P, given it is a pos- 
itive and additive distribution and conditional on an a priori model, m. The a priori 
probability distribution can be expressed as 

Pr(P| m, a, I) = — exp [a 5(P, m)] . (13a) 
Z s {a) 

The entropy function is given as (Skilling 1989a), 

N 

S(P, m) = ^ p j ~ m, - Pj In (Pj/rrij) . (13b) 

3=1 

where the normalization term, Z$ (a) is given as 

Z s (a) = J VP exp [a S(P, m)] (13c) 

d3d) 

a J 



(2k) 



v 7 det al 



(13e) 



i 



and the integration in (|13c|) involves the measure T>P = Y\^j=i Pj 5 dPj. 

The log term in ()13b|) ensures that positive and additive distributions are obtained 
and that P will have these fundamental characteristics. The a priori model, m, defines 
our ignorance/knowledge about P. That is, if we are unsure of the shape of P, it is 
best to admit our ignorance by assigning a uniform distribution over a specified range. 
The a priori model may also include data gathered from other sources, such as electron 
microscopy (e.g. TEM, SEM and SPTM) techniques. It may also include theoretical 
or analytical models. For example, recently in the literature (see Krill & Birringer 1998, 
Langford et al. 2000, Ungar et al. 2001) there has been a widespread use of the log-normal 
distribution for P. However, in the Bayesian formulation we do not explicitly define P 
as a log- normal distribution, but set the a priori model as a log- normal distribution and 
test it in the presence of the observed data. 

5'(P, m) is essentially a measure for P relative to m. Suppose the model m was 
found to be a log-normal distribution and its parameters determined using least squares 
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analysis. If the resulting P lies 'close' to m, the change in S will be small and there 
will be little difference between P and m; also, this would imply that the underlying 
crystallite-size distribution in the specimen is a log-normal distribution with values similar 
to those determined for m, since this assumption has been tested in the presences of the 
experimental data. On the other hand, if P lies 'some distance' from m, the change in S 
will be large; this would result in a considerable difference between m and P and would 
imply that the underlying size distribution is not a log-normal distribution with the values 
estimated for m. 

The denominator term in 1)1 ljl has an important application in selecting between var- 
ious kernels, K, for different crystallite shapes. It is called the evidence (Gull 1989), 

Pr(g| m, K, a, T) = J VP J da Pr(P, g, a\ m, K, a, J). (14) 

Including all the necessary terms, the a posteriori probability distribution for P can 
be expressed as 

1 e Q 

Pr(P| g, m, K, a, a, T) = -— — — — — — - — (15) 

Z s (a) Z L (a) Pr(g| m, K, a, 1) 

where Q = a S — \ L. For convenience, Q = Q(P, a), since P and a are the only 
two unknown terms. The a term in Q(P, a) can be interpreted as an undetermined 
Lagrangian multiplier. 

Determining the most probable size distribution, P, depends on maximizing 1)15(1 . 
which in turn requires determining the global minimum for Q(P)- There are several 
algorithms for determining P from Q(P), given its nonlinear characteristics (see Skilling 
& Bryan 1984, Bryan 1990). 

The approach we follow in determining the crystallite-size distribution is similar to 
that outlined by Bryan (1990) and Jarrell & Gubernatis (1996). We start with a large a 
value and step towards a ~ 0. For a given a, we determine P such that X7Q = 0. After 
stepping through a range of a values, a set of solutions, {P(q)}, is formed parameterized 
by a. The average distribution, (P), can be determined from the set of solutions {P(a)}, 

(P) = / doP(a) Pr(a| g, m, K, a, J). (16) 

where Pr(a| g, m, K, a, I) is normalized to unity for a G [a m j„, a max ]. In the application 
of the Bayesian/MaxEnt method, the selected range was defined by a £ [10~ 2 , 10 5 ]. The 
average particle size distribution can be used to determine the average specimen profile, 
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<f>, 

daI p P(a) Pr(a| g, m, K, a, 1) 

--min 

= I P (P)- 

3.4 Determining Pr(a|g, m, K, a, X) 

The a parameter in ()15|) is important in coupling the entropy function S(P, m) with 
the likelihood function L(P). It is also a 'nuisance parameter' and its influence can be 
integrated out. In evaluating (fl6|) . it is necessary to determine Pr(a| g, m, K, a, 1); we 
do this by integrating out the influence of P, 

Pr(a| g, m, K, a, 2) = J VP Pr(P, a\ g, m, K, a, I) 

Pr(a| 1) Pr(P| m, a, 1) Pr(g| P, K, a, 1) 



VP 



Pr(g| m, K, a, 1) 
Pr(a|J) 1 



Pr(g|m, K, a, I) Z s (a) Z L (a) 



x 



JvPe Q{P ' a \ (17a) 



and expanding Q(P, a) w Q(P, a) + |(P - P) T VVQ(P - P) about P for a given a. We 
note VQ = for P = P for a given a. On integrating, we have 

Pr(a| J) 1 



Pr(a| g, m, K, a, 1) 



Pr(g|m, K, a, X) Z s (a) Z L (a) 

( 27r )Afg/2 e Q(P,a) 

v / detVV(5(a) 

Pr(a|J) 1 



(17b) 



Pr(g|m, K, a, 1) Z L (a) 
1 <lr ' nI (17c) 



det(al + A) 



where VVQ(P, a) = VVQ(a) and A are the eigenvalues of (-WS 1 )^ VVL (-WS^a = 
{P2 } K T {o"~ 2 }K {Pz }. The quantities in parentheses represent diagonal matrices. In 
p7ajl . we have introduced the a priori distribution for a, Pr(o|T). Generally, we set 
Pr(a|X) as a uniform model over a range [a m i n , a max ]. Using (|T7|) we can evaluate (|TB|) . 
In practice, we determine lnPr(a| g, m, K, a, Z) and A for each P and a in the range of 
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3.5 Resolving overlapped profiles 

The formalism presented here enables single and overlapped profiles, and even whole 
patterns to be analyzed, provided that crystallite-size effects are the major broadening 
component. Line profiles are generally overlapped due to low unit cell symmetry. However, 
specimen broadening, such as size broadening from crystallites, can also cause profiles 
to be overlapped. In this case, the underlying invariant quantity is the crystallite-size 
distribution, P. The above integral equations for overlapped peaks can be expressed in 
terms of P. The general form of Q does not change; the term that does change is the 
kernel, K(s, D), 



where s' 0q = 2sin#og/ A and Ooq is the Bragg angle at the gth peak in the pattern. The 
k(s—s'; s' ) term expresses the instrument kernel at each peak position, 9o q . The I p (s' , D) 
term is invariant over the range of s. In terms of the Bayesian analysis presented above, 
nothing else changes. 

3.6 Error analysis 

Determining the errors in P over regions of importance is a final test for the quality of P. 
The error bars for P are dependent on the choice of the a priori model and the quality 
of the observed data, a. 

It is only possible to assign error bars over a defined region, because the errors between 
points are strongly correlated (Skirling 1989a, Sivia 1996). The region of interest may 
consist of features in the specimen profile or size distribution which may not be physical, 
such as ripples in the tails of the distribution or a second peak suggesting a bimodal 
distribution. Over the defined region, we are interested in the average integrated flux 



(s, D) = / £ - s'; s' 0g ) I p (s>, D) ds' 



r+oo 



K 



(18) 



(Skilling 1989 a), 



N 



N 



P 




(19) 



3=1 3=1 



rp rp 

P W/W W 



(20) 



where w is a 'window function' defined as 




(21) 
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and the region of interest is denned by rr'. Expanding Pr(P| g, m, K, a, a, X) about 
P, we have Pr(P| g, m, K, a, a, X) oc e2( P-P ) vv<2(P-P)_ This is a Gaussian centered 
about P. By inspection, the covariance matrix for P is given by -(VVQ)" 1 , where the 
elements in — (VVQ) -1 are strongly correlated with neighboring elements. Following the 
suggestion of Skirling (1989a) the variance for P is 

0-2 = w r [_(VV<5) _1 ] w/w T w. (22) 

Hence, we can assign error bars over a region of interest to the integrated flux of P. 

3.7 Fuzzy pixel approach for determining f 

It is often important to assess the specimen broadening by determining f , without making 
any assumptions concerning its functional form. This can be performed by deconvolving 
However, in determining f 'ringing effects' can appear in the solution. The ringing 
is often due to noise effects which are amplified and appear as unphysical oscillations in 
the solution (for example see Fig. 6 in Armstrong & Kalceff 1999). The above theory 
assumes that smoothing is applied globally. However, the ringing effects are local artifacts. 
In order to introduce 'local' smoothing, we must address how to decompose f . Explicit 
in the composition of f is that it is expressed as a superposition of delta functions, 

N 

f(2O) = '£ t 6(20 - 29i)ai (23) 
i=i 

where a = {a\, 02, • • • , a^} is the set of coefficients that define the amplitude of / at the 
Ith. position. However, (|23|) assumes a global smoothness, while the ringing effects are 
local effects. 

Following the suggestion of Sivia (1999, 1996), we blur 5(29) by including the spatial 
correlation length or width. To do this, we choose a basis function which includes a spatial 
correlation length as its width and reduces to 5(29) in the limit of the width going to zero. 
That is, we make the pixel at the Ith position of f fuzzy. A simple choice is to express f 
in terms of a sum of Gaussian function, 

N 



f(29) = exp 



(29 - 29jY 
2co 2 



ai (24) 

=1 L J 

where to is the width of the spatial correlation or fuzzy pixel. In the limit of u — > 0, ()24j) 
reduces to ((2*3*|) . 
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In matrix notation (|24|) becomes, 

f = Fa (25) 

where F is an [N x N] matrix containing the elements of the Gaussian function. 

How do we determine the optimum to given the observed data, kernel and statistical 
noise? 

The tools for addressing this question have been presented. That is, we employ Bayes' 
theorem to determine the a posteriori probability distribution for u conditional on the 
observed line profile. The u which maximises the resulting a posteriori probability dis- 
tribution becomes the optimum fuzzy pixel width, to. At a practical level, we replace the 
equations where P appears with a, and the kernel K is replaced by 

G = RF (26) 

where G = G(to). 

Applying Bayesian theory, the distribution for to can be determined by integrating out 
a and a, 

Pr(u\ g, m, a, 2) = j Vs. j daPr(a, a, uj\ g, m, a, 2) (27a) 

= J Va J da Pr(a| 2) Pr(w| 2) 

x Pr(a| g, m, a, a, uj, 2). (27b) 

Following the same steps as in p7|). we have 

Pr(a|T) Pr(w|X) 1 



Pr(u\ g, m, cr, 2) 



Pr(g| m, cr, 2) Z s {a)Z L {a) 
(27t)t e Q( a ^) 



(27c) 



y/det VVQ(a, u) 

where Q(a, a, to) = a S(a)—L(a, to) for the unknown terms a, a and to; and VVQ(a, to) = 
V a V a Q(a, to). 

Error bars can also be attributed to a and f . Using the results discussed in § 13.61 
the covariance matrix for a, V a V a Q can be determined. The corresponding covariance 
matrix for f can be determined from VfVfQ = F V a V a <5F T . On applying 1)22(1 the error 
bars for f can be determined. 

Traditionally this problem has been solved by applying classical techniques, such as 
the Stokes (1948) method. In order to overcome the numerical instability of the Stokes 
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method, methods such direct convolution (Howard & Snyder 1989, Howard & Preston 
1989) and profile fitting methods, such as the Voigt function (Wu et al. 1998, Balzar 
& Ledbetter 1993, Langford 1992, de Keijser et al. 1983, de Keijser et al. 1982) have 
been developed. These approaches assume an analytical profile function for the specimen 
profile; the convolution product between the instrument and specimen profile is refined, 
by updating the parameters that define the specimen profile, until the error between the 
calculated and observed data is minimized. These methods are a means to an end. There 
is often no physical basis for choosing a particular profile function, except that it results 
in a minimized error (Armstrong 1999). However, the Bayesian/fuzzy pixel/MaxEnt 
approach determines the maximally uncommitted solution or the solution with the least 
assumptions (Wu 1997), given all the available data and information. 



4 Generating & analyzing simulated CeC>2 data 
4.1 Generating the simulated data 

Particle-size distribution, P(D). In order to test the Bayesian/MaxEnt method, 
simulated data for the 200 and 400 line profiles from Ce02 were generated. The crystallites 
were assumed to be spherical in shape with a log-normal crystallite-size distribution, 



P(D) = 1 = exp 

V27r D 2 In 2 cj 



1 ( HP /Dp) 

2 V In erg 



(28a) 



where -Do is the median and <7q is the log-normal variance. The average diameter, (D), 
and variance, o~ 2 D ^ , of the distribution are related to these quantities by 

(D) = D e ln2<J( '/ 2 (28b) 

and 

a 2 D) = D 2 Q e ln2ao (e ln2(T0 - l) . (28c) 



The log-normal parameters used were Dq = 13.03 nm and <r 2 = 2.89. Using (|28b|) 
and (|28cj) . the average diameter and variance were determined to be, {D) = 15.00 nm 
and o~ 2 D ^ = 73.17 nm 2 , respectively. Using the results from Krill & Birringer (1998)(see 
Equations 6-8, p625), the corresponding area- and volume- weighted sizes were determined. 
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The area- and volume- weighted diameters for spheres are related to the sizes (Langford 
et al. 2000) by 

(29a) 



and 



(D)a = -(t) a 



(D) v = -(t) v . 



(29b) 



The area- and volume- weighted sizes, (t) a and {t) v , can be determined from the specimen 
profile, /, and Fourier coefficients, A(t), by using (Warren 1969) 

dA(t) 



dt 



(30) 



The volume-weighted size is inversely related to the integral breadth and can be deter- 
mined either directly from the specimen profile, /, or from its Fourier coefficients, A(t), 



(3 = / f(s)ds/f max (31a) 



oc 



2 / A(t)dt 



-i 



(31b) 



= (t);\ (31c) 

where j3 is in reciprocal space units. 

Using 1)30(1 and (|29|> . the area- weighted size and diameter were determined as {t) a = 
17.56 nm and {D) a = 26.34 nm, respectively. Using ()31a() and ()29() . the volume-weighted 
size and diameter were determined as {t) v = 26.18 nm and (D) v = 34.91 nm, respec- 
tively. These settings are considered as the theoretical values for the simulated data. 
The Bayesian/fuzzy pixel/MaxEnt results are compared with the theoretical sizes and 
percentage differences are determined. 

Line profiles, f(26) &; k(29). Using the parameters for the size distribution, the 
specimen profile for spherical crystallite, f(26), was modelled over the range (28q± 10) °29 
at a step size of 0.01 °26 (see (|EJ))- The simulation of the specimen profile over this 
range minimized any artifacts in the Fourier coefficients. The instrument profile, k(29), 
was modelled on the diffractometer parameters and LaBg line-position standard reference 
material, as discussed in § The split-Pearson VII function for the 200 line consisted of 
the following parameters: FWHM ioMJ = 0.030 °20, FWRM high = 0.027 °26 and m exp j ow = 
6.928, 

i^exp^igh — 11.324, where ni eX p are the split-Pearson exponents. The 'low' and 
'high' subscripts are with respect to the Bragg positions, 26q (see Fig. EJ). 
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Generating g{26). The observed line profiles, g(29), for the 200 and 400 lines con- 
sisted of the convolution of the specimen line profile, f{28), with the instrument line 
profile, k(20), Poisson noise, and a linear background level, b(29). Statistical noise was 
also imparted onto the background before adding it to the convoluted product. This is 
expressed by (0). 

The generation of g{29) was carried out over 29q ± 10°2# in order to minimize any 

truncation errors. The maximum peak height for the 200 line profile was set to 6500 counts 

(without background level and noise or a total of 7835 counts including background level 

and noise) and the peak-to-background ratio, R p b, was set to 6.0. The corresponding 

percentage error in the peak maximum was determined using 

i 

Rpb{Rpb + 1) 



1 

@peak 



(Rpb - 1) 



In 



x 100% (32) 



where I ma x,bg is the maximum number of counts, including background level. Simulated 
g(29) for the 200 and 400 line profiles are shown in Fig. |2j The uncertainty for the 200 
line was 1.5% in the peak height. Similarly, for the 400 line the maximum peak height 
was set to 1500 counts (2646 counts including background level and noise), the average 
peak-to-background ratio was set to 2.4; and the estimated statistical uncertainty in the 
peak height was found to be 4.0%. 

In order to simulate realistic conditions, the Bayesian/MaxEnt analysis of the g(29) 
was carried out in a truncated region (29q ± 2) °29 for the 200 and (29q ± 1.5) °29 for the 
400 line profiles. In the analysis, the background level was assumed to be unknown and 
was approximated by a linear function over this region. This was achieved by examining 
the Fourier coefficients of g(29) as the level was raised/lowered until distortions (i.e. 
'hook effect' etc.) were removed. Fig. |2] shows the simulated g(29) before and after the 
background level estimation for the 200 and 400 line profiles. 

Generating the kernels, R, I p & K The numerical evaluation of the instrument 
kernel R, line profile kernel I p , and scattering kernel K, are an important aspect in the 
application of the Bayesian/MaxEnt method. The evaluation of the fuzzy pixel kernel, F, 
is also important in the implementation of the fuzzy pixel/MaxEnt method in determining 
the specimen profile, /. This section expands on § 13.21 

The advantage of the Bryan algorithm (Bryan 1990) and the Bayesian/MaxEnt algo- 
rithm is that the search direction (or subspace) is defined by the singular value decompo- 
sition (SVD) of the scattering kernel, K. This approach is numerically efficient (in that 
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Figure 2: Simulated observed 200 and 400 Ce0 2 profiles, g{29). (a) The 200 profile (solid line) and estimated 
background level (dashed line) over (29o ± 2) °29, the range over which the analysis was carried out. (b) 
Logarithm of the 200 profile before (solid line) and after (dashed line) the background estimation, (c & d) Plots 
corresponding to (a) and (b), respectively, for the 400 profile over (29q ± 1.5) °20 



it reduces the number of floating point operations) and also numerically stable, since it 
does not utilize the full column-space of the kernels. As was pointed out in § 13.21 the 
vector-space spanned by the column vectors of K may be all (or nearly all) linearly de- 
pendent, causing it to be ill-conditioned. The ill-conditioned characteristics are overcome 
by the SVD of K, VSU r , where the 'singular space' spanned by the column vectors of 
U is used to define the subspace in which the size distribution can be determined. 

The instrument kernel, R, is an [M x N'] matrix. The elements of this matrix can 
be determined by Riy = k{29i — 29 y), where M > N' . This matrix can be mapped into 
reciprocal-space, s, by multiplying each column of R by d(29)/ds = A/ cos9j/. 

The line profile kernel expresses (JHJ) as an [N' xN] matrix, I p = [I p j>j], consisting of the 

line profile from a specific common volume (i.e. shape) function. The formalism presented 

here is completely general and any shape function can be used where appropriate. In this 

study, we have employed the common- volume function for spherical crystallites (seeEJ), 
f 

(33) 



P3 3 



+ ^ 

167rV4 

3 3 

8 



cos(27rs'.,Dj) 

167r3s'4 
3 



3 



for s'y = 



where the second term in (|33j) ensures that the line profile from a single spherical crystallite 
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is finite for s = 0. 

The evaluation of the scattering kernel, K, is the matrix product of the instrument 
kernel (mapped into s-space), R, and the line profile kernel, (|33j) . Using H1U|) 

K( Si , Dj) = 5D 5s' K s i ~ 4) D j) ( 34a ) 

f 

Kij = 5D5s ^ R ir I pjlj (34b) 

r 

K = 5D5s'-RI p (34c) 

where R has been mapped into s-space, 5s' is the step size in s'-space and approximates the 
integration in (|1U[). while 5D is the step size in D-space and approximates the integration 
in Q. Care must be taken in selecting 5D to avoid the under-sampling of (|33|) . 

4.2 Applying the fuzzy pixel/MaxEnt method for f(29) 

This approach involves determining the specimen profile from the simulated data. It is 
equivalent to solving the deconvolution problem, (^Q), and is an important first step in 
assessing the nature of the specimen broadening. In the past, we have applied the Skirling 
& Bryan (1984) algorithm with global smoothing (see Sivia 1996), which we refer to here 
as the 'old' MaxEnt method. However, in this section we apply the Fuzzy Pixel/MaxEnt 
method discussed in § 13 .7[ to determine f(26). The results are also compared with those 
from the 'old' MaxEnt method, and their reliability in reproducing the log-normal param- 
eters for the crystallite-size distribution (set in § I4.1j) are assessed. 

The specimen line profiles from the 'old' MaxEnt approach are given in Fig. El These 
results were compared with the theoretical specimen profiles by evaluating the Rf and 
R w values. A summary of these and subsequent analyses is given in Table ^ 

The 'old' MaxEnt method is not based on a Bayesian formalism (see Gull 1989, Skirling 
1989a) and spurious oscillations can appear in the solution specimen profile. This second 
point becomes important in analyzing high angle/low intensity profiles. This is further 
illustrated by inspecting the residuals in Fig. E^b), where the amplitude of the residuals 
is large in comparison with the normalized peak height. We contrast the results in Fig. El 
with the fuzzy pixel/MaxEnt method discussed in § 13.71 Using this theory, the fuzzy pixel 
distribution specimen profiles are shown in Fig. 0J The fuzzy pixel distribution determines 
the optimum fuzzy pixel width, uj (see (|27j) ). For the 200 line, the optimum value was 
found to be Cj ~ 0.07 °29 and for the 400 line, Cj ~ 0.05 °29. This defines the correlation- 
length scale of the noise in the simulated data and essentially filters out the noise effects. 
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Figure 3: 'Old' MaxEnt specimen profiles for the 200 and 400 line profiles, (a) The theoretical 200 specimen 
profile (solid line), 'old' MaxEnt specimen profile (long dashed line + error bars) and the residuals (short dashed 
line), (b) Corresponding results for the 400 line profile, as shown in (a). The horizontal error bars in (a) & 
(b) represent non-overlapping region of interest, while the vertical error bars represent the uncertainty in the 
averaged integrated flux over the region of interest. 

It is evident from the residuals of the multiple orders that smoothing of the specimen 
profile has been achieved using this approach. 

Using the line profiles determined and assuming a spherical crystallite shape, the 
parameters of the underlying log-normal size distribution can be reproduced by following 
the approach of Krill & Birringer (1998). These results are shown in Tabled The analysis 
has produced mixed results, due to the stringent but realistic conditions imposed on the 
background estimation. Comparing the 200 line profile results for the 'old' MaxEnt and 
fuzzy pixels methods, there is a noticeable improvement in the latter results over the 
former. This is not only seen in an improved Rf value, but also in the reproduced log- 
normal parameters. In the case of the 400 line, we notice that the Rf value has improved 
by a factor of ~ 3 and the volume- weighted size by a factor of ~ 1.5 for the fuzzy 
pixel/MaxEnt approach. However, the area-weighted size for the 400 line profile has not 
improved. As a consequence, when the underlying log-normal parameters are determined 
from the area- and volume-weighted sizes no improvements are gained. 

These results for the 400 line profile can be explained by the low peak-to-background 
ratio, statistical uncertainty, and the presence of systematic errors arising from the back- 
ground estimation. The peak-to-background ratio for the 400 line profile is 2.4. This low 
value results in an increased uncertainty in the estimated background level. From (J32[). 
we notice that as the peak-to-background ratio increases, the peak height uncertainty 
decreases and the dominant source of uncertainty becomes the statistical noise. 2 The 
2 Taking (|32fl . we see that in the limit of R p b — > 1, <J V — > oo. On the other hand, in the limit of R p i, — > oo, 
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Figure 4: The fuzzy pixel distribution and MaxEnt solutions for the 200 and 400 line profiles, (a) The 
logPr(w| g, m, er, I) distribution used to determine the optimum fuzzy pixel width, Co w 0.07 °29 for the 200 
specimen proifle. (b) Theoretical specimen profile (solid line), fuzzy pixel/MaxEnt specimen profile (long dashed 
line + error bars) and the residuals (short dashed line) for the 200 line profile, (c) The logPr(w| g, m, a, I) 
distribution used to determine the optimum fuzzy pixel width, u> rts 0.05 °29 for the 400 specimen proifle. (d) 
Theoretical specimen profile (solid line), fuzzy pixel/MaxEnt specimen profile (long dashed line + error bars) 
and the residuals (short dashed line) for the 400 line profile. The horizontal error bars in (a) & (c) represent 
the non-overlapping region of interest, while the vertical error bar represents the uncertainty in the averaged 
integrated flux over the region of interest. 

variance of the observed profile is determined by two components: the Poisson count- 
ing statistics, which can be approximated as ^g, for g >> 10 counts, and the estimated 
background level, b es t; it can be expressed as a 2 = g + b est . The presence of statistical 
uncertainty and the low peak-to-background ratio introduces uncertainties to the slope 
and intercept of the estimated background level. In turn, this introduces systematic errors 
to the Fourier coefficients of / (Armstrong 1999). Although the fuzzy/pixel method has 
been successful in improving the quality of the line profile (which amounts to reducing 
the statistical error in the solution line profile), the systematic errors have propagated 
to the Fourier coefficients of the specimen profile and in turn to the area-weighted size. 
Additional calculations and applying the above analysis to simulated data with zero back- 
up — » ^ I \J lmax.bg- For example, with R ~ 15, a p — 1.2/ y/I max ,bg- 
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Results 


'old' MaxEnt 


Fuzzy Pixel/MaxEnt 


200 


400 


200 


400 


R f (%) 


4.2 


10.9 


2.7 


3.1 


Rw (%) 


2.9 


3.7 


3.0 


3.7 


(t) a (nm) 


19.9 ±0.1 (13.3%) 


20.4 ±0.1 (16.0%) 


17.89 ±0.07 (1.9%) 


20.5 ±0.1 (16.5%) 


(£>>a(nm) 


29.8 ±0.2 


30.5 ±0.2 


26.8 ±0.1 


30.7 ±0.2 


(t) v (nm) 


26.63 ±0.07 (1.7%) 


28.0 ±0.2 (7.1%) 


25.86 ±0.04 (1.2%) 


27.4 ±0.2 (4.8%) 


(D) v (nm) 


35.51 ±0.09 


37.4 ±0.2 


34.48 ±0.05 


36.6 ±0.3 


D (nm) 


19.3 ±0.4 (48.1%) 


18.4 ±0.5 (41.5%) 


14.3 ±0.2 (10.1%) 


19.8 ±0.6 (52.0%) 




1.52 ±0.01 (10.7%) 


1.57 ±0.02 (7.8%) 


1.650 ±0.007 (3.0%) 


1.52 ± 0.02 (10.6%) 




21.06 ±0.43 (40.4%) 


20.4 ±0.5 (36.0%) 


16.3 ±0.2 (8.4%) 


2.6 ±0.6 (44.1%) 


f 7^ ) (nm 2 ) 


84 ± 5(15.3%) 


93 ± 7(27.2%) 


75 ± 3 (2.9%) 


89 ±8 (22.4%) 



Table 1: Area- and volume- weighted sizes for the 200 and 400 specimen line profiles (/) from the 'old' MaxEnt 
and fuzzy pixel/MaxEnt methods. (D) a and (D) v values were determined using (|29p. The Dq, ao, (D) and rf D ) 
values were determined from l|28|l . The percentage differences between the calculated and theoretical values are 
given in parentheses. 



ground (i.e. only Poisson noise) show percentage differences between the calculated and 
theoretical results of < 5% for both the 200 and 400 fuzzy pixel/MaxEnt specimen pro- 
files. This highlights the difficulty of analyzing high- angle/ weak line profiles, which clearly 
requires a good understanding of the background level in order to reduce the influence of 
systematic errors. 

The application of the fuzzy pixel/MaxEnt method for determining f(26) enables the 
specimen broadening to be assessed. This is important in the application of methods such 
as those of Warren-Averbach and Williamson-Hall. Furthermore, the analysis discussed 
here can be used as the a priori information of the Bayesian/MaxEnt analysis. The 
fuzzy pixel/MaxEnt approach overcomes the difficulties in commonly used deconvolution 
techniques (see Armstrong & Kalceff 1998) and resolves the 'ringing effects' in Armstrong 
& Kalceff (1999). 



4.3 Bayesian/MaxEnt method for P(D) using different m{D) 

The next stage in the analysis of the simulated data is applying the Bayesian/MaxEnt 
method to determine the particle distribution, P{D). In addition, two different approaches 
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for determining a model, m(D), were explored and their effects on P(D) were quantified. 
The two approaches were (i) uniform model over D £ [0, 60] nm and (ii) 'low resolution' 
approach (Armstrong 1999) using the log-normal distribution parameters determined in 
§ 14.21 as the prior. 

Uniform model The Fourier coefficients A(t) of the fuzzy pixel/MaxEnt specimen 
profiles (not shown here) suggest the maximum size of the crystallites is ~ 60 nm, since 
A(t) ~ at this length. Using this information, a uniform distribution was defined over 
D £ [0, 60] nm. The corresponding Bayesian/MaxEnt results are shown in Fig. [5J The 
posterior distribution for a is shown in Figs. Efa) & (c) for the 200 and 400 profiles, 
respectively. This distribution was used to average over the set of solutions {P} for each 
case. The Bayesian/MaxEnt results are given in Fig. Efb) & (d) for the 200 and 400 
profiles respectively. 

Using a uniform model, the Bayesian/MaxEnt size distributions where compared with 
the theoretical size distribution, P{D). The Bayesian/MaxEnt results share 'global' fea- 
tures with the theoretical size distributions. However, 'local' features are poorly defined, 
especially in the region of < D < 10 nm. This is a direct consequence of the uniform 
model and the lack of relevant information in the data; that is, it assigns an equal weight 
to all sizes over D. The vertical error bars in both cases correctly represent the misfitting 
between the theoretical and Bayesian/MaxEnt size distributions; additionally, their mag- 
nitude also signifies that a uniform model transfers little or no useful information. This 
can also be seen in the parameters for the Bayesian/MaxEnt distribution compared with 
their theoretical values in Table 121 In determining the log-normal parameters from the 
Bayesian/MaxEnt P(D), the fitted distribution produce reasonable results. This suggests 
that, although the a proiri model is uniform, the Bayesian/MaxEnt method can 'extract' 
some information concerning the underlying distribution from the simulated data. 

'Low resolution' approach A log-normal a priori model used in the Bayesian/MaxEnt 
method was defined from the -Do and o"o of the 200 fuzzy pixel/MaxEnt line profile (see Ta- 
ble Unlike the uniform model, this model defines local features of the size-distribution. 
The Bayesian/MaxEnt results using this model are shown in Fig. Eland the determined 
parameters in Table |2J Before discussing the results, it is interesting to point out that 
the log-normal model and theoretical size-distribution produce a difference of 15.8%. One 
of the aims of this section is to assess whether this difference has been imparted to the 
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Figure 5: Bayesian/MaxEnt crystallite size distributions using a uniform a priori model. (a) The 
Pr(a| g, m, a, X) distribution, 117(1 . used to average over the set of solutions, {P(a)}. (b) The theoretical 
crystallite size distribution (solid line), Bayesian/MaxEnt size distribution (long dashed line + error bars), and 
the uniform a priori model (short dashed line), (c) & (d) The corresponding Pr(a| g, m, cr, I) distribution and 
Bayesian/MaxEnt size distribution for the 400 line profile. 



Results 


Uniform 


model 


'Low' Res. 


Model 


200 


400 


200 


400 


R f (%) 


23.0 


40.0 


22.2 


19.1 


D (nm) 


13.9 ±0.3 (6.5%) 


11.9 ±0.9 (8.8%) 


14.8 ±0.2 (13.4%) 


12.5 ±0.2 (4.4%) 




1.589 ±0.003 (6.5%) 


2.14 ±0.03, (25.8%) 


1.612 ± 0.002 (5.1%) 


1.544 ± 0.002 (9.2%) 


(D) (nm) 


15.5 ±0.3 (3.0%) 


15 ±2 (5.8%) 


16.6 ±0.2 (10.4%) 


13.7 ±0.2 (8.7%) 


°\d) ( nm2 ) 


57 ±3 (22.0%) 


197 ± 145 (> 100%) 


70 ± 2 (3.9%) 


39 ± 1 (46.7%) 



Table 2: P{D) results from the Bayesian/MaxEnt method for the 200 and 400 line profiles using different a 

i 

(D) 



priori models. The values for Dq, ao, (D) and cr?„> were determined by fitting the Bayesian/MaxEnt solutions 



with a log-normal distribution. The percentage difference between calculated and theoretical values are given 
in parentheses. 



Bayesian/MaxEnt size-distribution. 

Comparing the a posteriori distribution for a using a uniform model (see Figs. [5] (a) 
<fe (c)) with that of the log-normal distribution, given in Figs. Efa) & (c), we notice that 
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the effect of the log- normal model is to shift the distribution in a-space and widen it. 
Essentially the solution space parameterized by a has been expanded to encompass those 
solutions which correspond to the available a priori and experimental data. 

The Bayesian/MaxEnt size distributions, given in Figs.HJb) & (d), compare reasonably 
well with the theoretical distribution. However, there is noticeable misfitting between 
these distributions. Further, the Bayesian/MaxEnt solution has been shifted slightly 
relative to the log-normal model. This is also evident in the Rf for the 200 and 400 size 
distributions, given in Table El The Rf for both solutions has increased relative to the 
log-normal model by an additional ~ 3 — 6%. This can also be seen by comparing the 
percentage differences for the Dq, <to, (D) and a 2 D ^ parameters for the 200 fuzzy pixel 
solution, given in Table ^ (i-e. third column), with those given in Table El using the 'low 
resolution' method, where there is a slight increase in the percentage difference, with the 
exception of the value. Additional calculations suggest that misfitting between the 
solution and theoretical size distributions arises from errors in the a prior model. The 
influence of the background estimation which was problematic in the fuzzy pixel analysis 
does not seem to be a factor in this analysis. 

While there exists some misfitting between the solution and theoretical size distribu- 
tions, the vertical error bars correctly account for this misfitting. This characteristic of 
the Bayesian/MaxEnt can be seen for both the uniform and non-uniform models. In- 
deed, this feature of the method ensures that it is fully quantitative, and represents a 
clear strength over existing methods. Comparing these solutions with those using a uni- 
form model, considerable improvement in the size distribution has been achieved. The 
'local' information defined in the log-normal a priori model has been imparted to the 
Bayesian/MaxEnt solution. 

This analysis also demonstrates the difficulty in estimating a suitable non-uniform 
model based on the current techniques. Further, any uncertainty in the model parameters 
is also passed on to the solution distribution. This indicates the need to quantify the 
uncertainty in the model parameters and quantify how these uncertainties are passed on 
to the solution size distribution. 
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Figure 6: Bayesian/MaxEnt crystallite size distributions using a log-normal a priori model, (a) The 
Pr(a| g, m, a, I) distribution, 1)17(1 . used to average over the set of solutions, {P(a)}. (b) The theoretical 
crystallite size distribution (solid line), Bayesian/MaxEnt size distribution (long dashed line + error bars) and 
the low-resolution a priori model (short dashed line), (c) & (d) The corresponding Pr(a| g, m, cr, I) distribution 
and Bayesian/MaxEnt size distribution for the 400 line profile. 

5 Experimental Details 

Analysis of the simulated data highlighted difficulties of background estimation and the 
effect of the a priori model on the Bayesian/MaxEnt size distribution. However, this 
analysis provided a useful understanding of the experimental condition which were used 
in conducting an appropriate set of measurements. The fuzzy pixel/Bayesian/MaxEnt 
methods were applied to experimental Ce02 diffraction data to determine the specimen 
profiles, crystallite shape, and size distribution. These results are compared with trans- 
mission electron microscopy data. 

5.1 XRD Details 

The CeC>2 specimen used here was prepared for the recent CPD and IUCR size round 
robin by Louer & Audebrand (2001). 

Diffraction patterns were collected on a Siemens D500 3 diffractometer equipped with 
3 Certain commercial materials, equipment and software are identified in order to adequately specify the 
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a focusing Ge incident beam monochromator, sample spinner and a scintillation detector. 
Copper K Q i radiation with a wavelength A = 0.15405945 nm was used. The divergence 
slit was 0.67°, while the receiving optics included a slit of 0.05° and 2° Soller slits. Data 
were collected in discrete regions straddling the maxima of each profile, with the step and 
scan width of each region being varied in correspondence with the FWHM. Count times 
were varied so as to obtain an approximately constant total number of counts for each 
scan region. The instrument profile function was determined using a split-Pearson VII 
profile shape function fitted to 22 reflections collected from SRM 660a (LaBg). Fig. 
shows the FWHMs and exponents for the split-Pearson VII profile function. The low- 
and high-FWHMs were fitted using (Cheary & Cline 1995) 



FWHM 2 = A tan 2 9 + B cot 2 9 + C tan 9 + D, 
while the low- and high-exponents were fitted using a fifth-order polynomial. 
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Figure 7: Calibration plots for the split-Pearson VII line profiles, generated from the SRM 660a (LaB@) 
diffraction pattern and used to model the instrument function, K(28). (a) The FWHM vs 28 for low- (+ & 
solid line) and high- (x & dashed line) angle sides of the peak, (b) The exponents vs 28 for low- (+ & solid 
line) and high- (x & dashed line) angle sides of the peak. 



The count times for the Ce02 data were optimized using (|32j) so that the percentage 
error was kept in the range 1-3% for all peaks in the Ce02 pattern. The scan ranges for 
the Ce02 data were considerably wider, in proportion to the FWHM, than those used for 
the data collection from SRM 660a. This ensured a reasonable determination of the tails 
of the profiles and background levels. The Ce02 200 line profile is shown in Fig. E(a). 
This illustrates a typical experimental line profile using the above conditions and settings. 
The estimated (linear) background level is also shown. A log plot of the 200 line before 



experimental procedure. Such identification does not imply a recommendation or endorsement by NIST, nor 
does it imply that the materials or equipment or software are necessarily the best available for the purpose. 
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and after the background estimation is shown in Fig. IHJb) . The procedure for determining 
the background level is as described in § 14.11 




32 33 34 35 J 32 33 34 35 

Position °26> Position °26» 



Figure 8: Ce0 2 e Experimental 200 profile, g{26): (a) The observed 200 profile (solid line) and estimated 
background level (dashed line) over (2#o ± 2) °26, the range over which the analysis was carried out. (b) 
Logarithm of the 200 measured profile before (solid line) and after the background estimation (dashed line). 



5.2 TEM Details 

Particle agglomerates were gently crushed in ethanol using a mortar and pestle. A portion 
of the dilute slurry was dispersed on a holey carbon film and left to dry. Once in the TEM, 
a series of micrographs of particles were taken at a fixed magnification of 200k x. In the 
preliminary examination reported here, these negatives were scanned and analysed by 
manually approximating the particle size with an oval. The oval's major and minor axes 
were adjusted so as to tangentially intersect the particle surface facets. 

There are several sources of error in the measurements: TEMs typically have a 5% error 
in length scale measurements; also, imaging the particle clusters means that particles are 
at different heights, which results in Fresnel fringes around the particles making it harder 
to identify particle edges. Further, larger particles give better contrast and it is easier to 
detect their edges, so it is possible to inadvertently preferentially choose larger particles 
over smaller ones. 

A frequency histogram for about 850 particles is shown in Fig. 112( d) . This figure also 
shows the Bayesian/MaxEnt size distributions (discussed in § I6.2|) determined from the 
non-overlapped hkl profiles of the Ce02 diffraction pattern. A TEM micrograph of the 
Ce02 crystallites is shown in Fig. 1131 It can be seen from the larger particles that they 
have a spherical-like morphology. 
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6 Analysis of CeC>2 x-ray diffraction data 

Two levels of application of the Bayesian and MaxEnt theory has been chosen in our 
analysis. We refer to these as the qualitative and quantitative approaches, to reflect their 
degree of rigor (see ? l6.1l fc KT!fl respectively). 

6.1 Qualitative analysis 

The qualitative analysis is used to determine the type and nature of specimen broadening, 
by first determining the specimen profile, /, followed by the application of the Warren- 
Averbach and Williamson-Hall methods. The integral breadths, from a Williamson-Hall 
plot, identify the presence of both strain- and size-broadening contributions, while plotting 
multiple-order Fourier coefficients and all other available Fourier coefficients on the same 
axes, also allows size- and strain-broadening contributions be identified (see Armstrong 
1999). 

We have introduced the fuzzy pixel/MaxEnt method for determining / to ensure that 
no artifacts (such as spurious oscillations in the tails of /) are promulgated to the solution, 
and also to preserve the positivity of /. 

We stress that unlike traditional methods, the approach in this section makes no as- 
sumptions at all about the nature of the specimen profile or broadening (i.e be it Gaussian, 
Lorentzian, Voigtian etc.). Thus, in further distinction from traditional deconvolution 
approaches, our approach facilitates the subsequent unbiased assessment of anisotropic 
broadening in the specimen, for example using contrast factors (Ungar & Tichy 1999). 

Fig. El shows an example of the fuzzy pixel/MaxEnt method applied to the Ce02 
measured 200 line profile given in Fig. |H1 Fig. IH^a) is an example of the 'old' MaxEnt 
method, showing the effect of noise amplification. On applying the fuzzy pixel/MaxEnt 
method, the correlation length scale for the profile was determined, as discussed in § 13.71 
and is shown in Fig.|H{b); the subsequent / and Fourier coefficients for the 200 line profile 
are given in Figs.EJc) & (d), respectively. As demonstrated in the analysis of the simulated 
data, there is noticeable improvement in the quality of the solution line profile using the 
fuzzy pixel/MaxEnt method. This approach was applied to all the non-overlapped line 
profiles, including 111, 200, 220, 400, 422, 511 and 531. 

The volume- and area-weighted sizes were determined from the Williamson-Hall plot 
and Fourier coefficients, respectively. These results are shown in Fig. and summarized 
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Figure 9: Specimen profiles from the 'old' MaxEnt method and fuzzy pixel/MaxEnt method for the measured 
CeC>2 200 line, (a) 'Old MaxEnt' specimen profile (solid line + error bars), (b) logPr(w| g, m, a, X) distribution 
to determine the optimum fuzzy pixel width, Co 0.07 °29. (c) Fuzzy Pixel/MaxEnt specimen profile (solid line 
+ error bars), (d) Fuzzy Pixel/MaxEnt Fourier coefficients (solid line). 
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Figure 10: Volume- and area-weighted sizes from the Fuzzy Pixel/MaxEnt line profiles for measured Ce02 
data, (a) Classical Williamson-Hall plot of the integral breadths, showing no dependency on hkl. This suggests 
that the crystallites are spherical in shape, (b) Area-weighted sizes determined from the Fourier coefficients of 
the Fuzzy Pixel/MaxEnt line profiles. 



Fig. llOf a) shows the Williamson-Hall plot for the non-overlapped line profiles. It is 
evident that size effects are the dominant source of specimen broadening, since there 
is no detectable slope in the integral breadth data. Moreover, there is no systematic 
variation of the integral breadths with hkl, further suggesting that the crystallite shape 
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hkl 


<*>a 


(D)a 


(t)v 


(D)v 


Do 


c 


(D) 


°\d) 




(nm) 


(nm) 


(nm) 


(nm) 


(nm) 




(nm) 


(nm 2 ) 


111 


19.21 ± 0.05 


28.81 ± 0.07 


22.88 ± 0.03 


30.50 ± 0.04 


25.0 ± 0.2 


1.270 ± 0.007 


25.7 ± 0.2 


39 ± 2 


200 


16.04 ± 0.06 


24.06 ± 0.08 


22.22 ± 0.05 


29.63 ± 0.06 


14.3 ± 0.2 


1.578 ± 0.007 


15.9 ± 0.2 


58 ± 2 


220 


18.92 ± 0.04 


28.38 ± 0.06 


23.24 ± 0.04 


31.00 ± 0.05 


22.8 ± 0.2 


1.345 ± 0.006 


23.8 ± 0.2 


52 ± 2 


400 


15.76 ± 0.06 


23.64 ± 0.09 


22.03 ± 0.11 


29.4 ± 0.2 


13.7 ± 0.3 


1.59 ± 0.01 


15.3 ± 0.3 


56 ± 3 


422 


15.45 ± 0.08 


23.2 ± 0.1 


21.5 ±0.1 


28.6 ± 0.1 


13.7 ± 0.3 


1.58 ± 0.01 


15.2 ± 0.3 


54 ± 3 


511 


15.91 ± 0.07 


23.9 ± 0.1 


21.9 ±0.1 


29.2 ± 0.2 


14.4 ± 0.3 


1.57 ± 0.01 


16.0 ± 0.3 


57 ± 3 


531 


15.04 ± 0.04 


22.55 ± 0.07 


21.9 ± 0.1 


29.20 ± 0.16 


11.8 ± 0.2 


1.66 ± 0.01 


13.5 ± 0.2 


53 ± 3 


Average 


16.6 ± 0.2 


24.9 ± 0.2 


22.2 ± 0.2 


29.6 ± 0.3 


16.5 ± 0.6 


1.51 ± 0.03 


17.9 ± 0.7 


53 ± 7 



Table 3: Summary of CeC>2 data analysis. The area- and volume- weighted sizes were determined from the 
specimen profile of the fuzzy pixel/MaxEnt method. The (t) a and (t) v results were determined directly from / 
using (|30() and (|31|l . respectively. The area- and volume- weighted diameters were determined using l|29(l . while 
the log-normal parameters were determined from Krill & Birringer (1998) and using Ij28(l . 

is independent of hkl. From these results, we can infer that the average shape of the 
crystallites is spherical. This is further supported by the area-weighted sizes shown in 
Fig. HUf bb These results were determined by applying Q3U|) to the Fourier coefficients of 
the fuzzy pixel/MaxEnt specimen profiles and plotted over the entire 2#-range. Again, the 
relative uniformity of this plot suggests that size effects are the major source of specimen 
broadening and that crystallites are near-spherical in shape. Deviations for the 111 and 
220 data points in Fig. HOf b) arise from the differentiation of (|3()|) in the region t — > 0, 
where perturbations in the Fourier coefficients cause large changes in the area-weighted 
size (Armstrong 1999). In addition, the Fourier coefficients for all the non-overlapped hkl 
lines suggest that the maximum crystallite size is ~ 50 — 60 nm. An example of this can 
be seen in Fig. Efd), where A(t) ~ for ~ 50 — 60 nm. This can also be seen from the 
discussion in § 12.21 and by inspecting Fig. ^ where the boundary conditions for A(t) (or 
V(t)) are defined in terms of the maximum size in the direction of the scattering vector. 

Referring to Table 01 a spherical crystallite shape model was used to determine the 
area- and volume- weighted diameters, together with ()29aj) and (|29b|) . respectively. The 
log-normal distribution parameters, Dq, o~q, (D) and were determined using the 

equations developed by Krill & Birringer (1998) and (|28|) . which relate the log- normal 
parameters to the area- and volume- weighted sizes and the average diameter, (D), and 
variance o-(d)- 

It can be seen from Fig. ED and Table 01 that the area- and volume- weighted sizes are 
relatively uniform for the 29 (or hkl) range. The quoted uncertainty for the averages was 
determined from a sum of squares of the uncertainties in the tabulated results. 

The average results for Dq and o~q, were used to define a log-normal a priori model 
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in the Bayesian/MaxEnt method (see § I6.2|) . By defining the a priori model as a log- 
normal distribution, we are essentially testing the assumption that the size distribution 
is a log- normal distribution. 

If the underlying size distribution is indeed log-normal, with parameters close to those 
in Table El then we would expect the Bayesian/MaxEnt solution to lie 'close' to the a 
priori model. However, if the Bayesian/MaxEnt solution were 'some distance' from the 
a priori model, this would imply that either the underlying parameters or the model 
were inappropriately defined. The former case was demonstrated in analysis of the simu- 
lated data (see § 14.3)1 , where uncertainties in the log-normal model were passed onto the 
Bayesian/MaxEnt solution; the latter case requires additional Bayesian analysis to test 
possible models (Sivia et al. 1993, Loredo & Lamb 2001). 

In summary, the qualitative analysis has applied the fuzzy pixel/MaxEnt method to de- 
termine the specimen profile / for all non-overlapped line profiles from the CeC>2 measured 
data (see Fig- EJ) • This enabled subsequent analyses to determine the Fourier coefficients, 
integral breadths, and the area- and volume-weighted sizes. Fig. EH and Table El clearly 
indicate that the Ce02 specimen on average consists of spherical crystallites. While a 
log-normal distribution can be fitted to these results, a quantitative method such as the 
Bayesian/MaxEnt technique is needed to determine the Ce02 size distribution directly 
from the experimental data and to verify the assumption of a log-normal model. 

6.2 Quantitative analysis 

The quantitative analysis method uses the a priori information determined from the 
qualitative analysis and the available experimental data (such as the instrument and profile 
kernels, statistical uncertainties and experimental line profiles) to directly determine the 
crystallite size distribution. 

The MaxEnt method also enables an a priori model to be included, while quantifying 
the uncertainty in the solution size distribution. 

In this section, we apply the Bayesian/MaxEnt method to the Ce02 data. The anal- 
ysis presented here follows the steps discussed in § 14.31 Two a priori models are used: 
- (i) a uniform model, and (ii) the log-normal distribution determined in § 16.11 The 
Bayesian/MaxEnt size distributions for each case are fitted with a log-normal distribu- 
tion, while the size distributions from (ii) are compared with the TEM size distribution, 
with very good agreement. 
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Uniform model A uniform model was denned over the region of D € [0, 60] nm 
determined by the Fourier coefficients of the specimen profile, where A(t) ~ 0. This is 
illustrated by the Fourier coefficients for the 200 line profile, given in Fig. ESd). The 
Bayesian/MaxEnt size distributions using this model are shown in Fig.^2f° r the 200 line 
profile (see Fig. ^2 (a & b)). The size distributions for the non-overlapped line profiles 
are given in Fig. ^2 (c & d). 
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Figure 11: CeC>2 Bayesian/MaxEnt crystallite-size distribution using a uniform a priori model over D 6 
[0, 60] nm: (a) The Pr(a| g, m, a, I) distribution, |17jl. used to average over the set of solutions, {P(a)} for 
the 200 line profile; (b) Bayesian/MaxEnt crystallite-size distribution (solid line + error bars) and the a priori 
model over D S [0, 60] nm; (c) CeC>2 Bayesian/MaxEnt crystallite-size distributions for the various CeC>2 hkl 
profiles; (d) Average diameters from the uniform model (+) and log- normal models (x). The horizontal lines 
represent the average for each model. 

The uncertainties in the Bayesian/MaxEnt size distribution for the 200 line profile 
indicate how little useful a priori information has been transferred from the uniform 
model to the final distribution. We also notice that the final distribution is some distance 
from the model, illustrating that the underlying CeC>2 crystallite size distribution consists 
of a non-uniform structure. As can be seen in Fig. Hir e), the size distributions are poorly 
defined in the range of D € [0, 5] nm; while for D > 5nm the non-uniform structure 
is evident. Since the size distribution is the only invariant quantity, we also expect the 
solution for each hkl to be the same. From the size distributions given in Fig. lllf c). there 
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hkl 


#0 

(ran) 




(D) 
(nm) 


(nm 2 ) 


111 


13.1 ±0.4 


1.9 ±0.1 


16.3 ±0.8 


140 ± 43 


200 


14.7 ±0.3 


1.61 ±0.03 


16.4 ±0.4 


69 ±7 


220 


15.5 ±0.4 


1.70 ±0.08 


17.8 ±0.6 


104 ±25 


400 


15.8 ±0.8 


1.7 ±0.1 


18 ± 1 


120 ± 48 


422 


11.1 ±0.2 


1.728 ±0.007 


12.8 ±0.2 


58 ±2 


511 


14.3 ±0.3 


1.66 ±0.05 


16.3 ±0.4 


78 ± 15 


Average 


14 ±1 


1.7 ±0.2 


16 ±2 


95 ±71 



Table 4: Size distribution results using a uniform a priori model in the Bayesian/MaxEnt method. The 
Bayesian/MaxEnt size distributions given in Fig. Hir e) were fitted with a log-normal size distribution and the 
above parameters determined. 

is a broad agreement between the distributions, with the exception of the ill and 422 
cases. Both of these distributions lie at the extremities of the diffraction pattern and are 
more likely to be susceptible to larger experimental uncertainties. 

The Bayesian/MaxEnt size distributions were fitted with a log-normal model and the 
Do, do, (D) and of™ parameters determined. These results are given in Table HJ The 
uncertainties in the solution distributions for the uniform model are also reflected in the 
uncertainties in the fitted quantities. This is especially the case for the variance of the size 
distributions, of™, with an error of ~ 80%. This large uncertainty is a consequence of 
the scatter of size distributions shown in Fig. Illf c). Such scatter is also noticeable when 
the average diameters, (D), (Table 0J) are plotted, as shown in Fig. Illf d). The average 
values for Dq, do, (D) and are again in broad agreement with results determined in 
§ lti.ll once the uncertainties are taken into account. 

In summary, the use of the uniform model in the Bayesian /MaxEnt method has shown 
that there is a non-uniform structure to the Ce02 size distributions. However, the lack 
of information in this model results in large uncertainties and considerable scatter of the 
distributions when plotted on the same axes (see Fig. Hir e)). 

Log-normal model The parameters for the log-normal distribution determined in 
S IG.ll were used as the non-uniform a priori model in the Bayesian/MaxEnt method. The 
model was defined over the range of D G [0, 60] nm. 

The Bayesian/MaxEnt size distributions for this model are shown in Fig. 1121 The 
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results are listed in Table|SJ Figs. ll2f a') & (b) show the results for the 200 size distribution 
using this model. The Bayesian/MaxEnt solution lies close to the log- normal model, 
while the uncertainties have decreased considerably compared with the size distribution 
(using a uniform model) in Fig. Illf b); however, although the vertical error bars have 
decreased, they are still considerable. This can be explained in terms of the influence 
of the peak-to-background ratio. As discussed in § 14.21 the variance of the experimental 
data is determined by two terms, the statistical noise and the variance on the estimated 
background level. If the peak-to-background ratio is large (> 10), then the statistical 
noise dominates and the corresponding error bars in the Bayesian/MaxEnt distribution 
become small when the solution is close to the underlying size distribution. This has been 
demonstrated using computer simulations. However, if the peak-to-background ratio is 
finite (< 10), the corresponding error bars in the MaxEnt/Bayesian solution remain finite 
regardless of how close the solution is to the underlying distribution. This is a direct 
consequence of determining the size distribution directly from the experimental data. 

The Bayesian/MaxEnt size distributions for all the non-overlapped hkl line profiles are 
shown in Fig. I12f c). They lie very close to each other, reflecting the invariance of the size 
distribution and remaining close to the log-normal model. The scatter in the size distri- 
butions that was noticeable in Fig. Hir e) for the uniform model has disappeared. Further, 
these results imply that the underlying size distribution from the CeC>2 crystallites can be 
described by a log-normal distribution. Comparing these results with the TEM size distri- 
bution, very good agreement is obtained for 14 < D < 60 nm. Due to its poor statistics, 
the TEM size distribution is ill-defined for D < 14 nm. As mentioned above, the Ce02 
agglomerates were not separated, making it difficult to identify the smaller crystallites 
and contributing to the poorly defined region for D < 14 nm. The TEM size distribution 
given in Fig. I12f , represents a preliminary set of data and further results are currently 
being collated. 

The correspondence between the Bayesian/MaxEnt size distributions and the TEM 
distribution is very good for D > 14 nm. The size distributions shown in Fig. I12r c) 
were fitted with a log-normal distribution and the Dq, do, (D) and &\p\ parameters were 
determined. These results are shown in Table El The fitted distribution compared very 
closely with the solution distribution. The small uncertainties in the fitted quantities of 
Table El reflect the quality of the Bayesian/MaxEnt distributions. This can also be seen 
in the low uncertainty in the variance, of D \, which is ~ 8%. 
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Figure 12: CeC>2 Bayesian/MaxEnt crystallite-size distribution using a log-normal a priori model over D 6 
[0, 60] nm. (a) Pr(a| g, m, a, I) distribution, 1)17(1 . used to average over the set of solutions {P(a)}, for the 200 
line, (b) Bayesian/MaxEnt crystallite-size distribution (solid line + error bars) and the a priori model (dashed 
line) over D 6 [0, 60] nm. (c) CeC>2 Bayesian/MaxEnt crystallite-size distributions for the non-overlapped line 
profiles, (d) Comparison of TEM and Bayesian/MaxEnt size distributions. Due to difficulties in de-aggregating 
the Ce02 particles and identifying smaller crystallites, the TEM size distribution is poorly defined for D < 14 nm. 
However, for D > 14 nm, the comparison is excellent. 

The average quantities given in Table [3 can be considered to represent the size dis- 
tribution for the Ce02 specimen. Hence, the use of the fuzzy pixel/Bayesian/MaxEnt 
methods has determined the specimen profile, /, and enabled size effects to be identified 
as the major source of specimen broadening. The analysis of the line profiles has shown 
that the crystallite shape is spherical, on average. The Fourier coefficients of the specimen 
profiles also show that the crystallites have a maximum size of ~ 60 nm. This was subse- 
quently shown from the Bayesian/MaxEnt size distributions. Using this information, the 
Bayesian/MaxEnt method successfully determined the CeC>2 size distribution. While the 
size distributions using a uniform a priori model broadly agree with the results from the 
fuzzy pixel analysis, the uncertainty in the results is large; on using a log-normal a priori 
model considerable improvements in the size distribution were obtained. The non-uniform 
structure in the model has been transferred to the Bayesian/MaxEnt solution. 

The TEM micrograph of the Ce02 specimen, shown in Fig. 1131 confirms the results 



Armstrong et al. (zUUl J. naycsian micrcncc ol nanoparticlc- broadened x-ray line prohlcs 



hkl 


Do 
(nm) 


fTn 

u 


(D) 
(nm) 


(nm 2 ) 


111 


15.9 ±0.2 


1.505 ±0.008 


17.2 ±0.2 


54 ±2 


200 


16.64 ±0.04 


1.469 ±0.005 


17.91 ±0.04 


51 ±1 


220 


15.68 ±0.06 


1.502 ±0.008 


17.04 ±0.08 


52 ±2 


400 


15.48 ±0.01 


1.480 ±0.005 


16.72 ±0.03 


46 ± 1 


422 


15.86 ±0.07 


1.4799 ±0.0002 


17.12 ± 0.08 


48.7 ±0.5 


511 


16.21 ±0.07 


1.497 ±0.002 


17.58 ±0.08 


54.7 ±0.6 


531 


16.32 ±0.07 


1.500 ±0.005 


17.72 ±0.09 


56 ± 1 


Average 


16.0 ±0.2 


1.49 ± 0.01 


17.3 ±0.3 


52 ±3 



Table 5: Size distribution results using a log- normal a priori model in the Bayesian/MaxEnt method. The 
Bayesian/MaxEnt size distributions given in Fig. II lf c^l were fitted with a log-normal size distribution and the 
above parameters determined. 

that have been determined from the x-ray diffraction data. From the micrograph, it 
can be seen that the crystallites are near-spherical in shape. It can also be seen that 
the crystallites are in the range of size predicted by crystallite-size analysis. Considerable 
overlapping of the crystallites, which complicates the task of gathering sufficiently reliable 
data for the TEM size distribution is evident. 

7 Conclusion 

The central aim of this study was to develop a single and self-contained method for 
determining the crystallite-size distribution and shape from experimental line profile data. 
We have shown that the crystallite-size distribution can be determined without assuming 
a functional form for the size distribution, determining instead the size distribution with 
the least assumptions. 

This was achieved by reviewing size broadening theory showing how the observed line 
profile can be expressed in terms of the instrument kernel, line profile kernel and size 
distribution. It was also shown that the instrument and line profile kernels could be 
combined into a single kernel, hence enabling the simultaneous removal of instrumental 
broadening while determining the size distribution (see §|2J). 

The development of this method made use of two fundamental observations — that 
distributions such as the specimen profile and size distribution must be both positive 
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Figure 13: A TEM micrograph of the CeC>2 specimen taken at a magnification of 200 kx. The crystallites 
appear to have a spherical-like shape and size that are in the range predicted by the crystallite-size analysis 
presented here. 

and additive. Drawing on extensive theoretical developments, the entropy function was 
selected as the function that can attribute values to the specimen line profile and size dis- 
tribution, while preserving the positivity and additivity of the profile and distribution. It 
can be also argued that the entropy function is the only function that produces consistent 
results in the light of experimental data (see § 13. 2 1) . 

Using the mathematical and statistical foundations of Bayesian theory, the a pos- 
teriori distributions of P(D) in terms of the experimental data, statistical noise and 
scattering kernel can be determined. By maximizing this distribution, the most probable 
size distribution can be calculated from the experimental line profile, without making any 
assumptions concerning the functional form of the size distribution. Determining the most 
probable size distribution addresses the inherent non-uniqueness and ill-conditioning in 
the integral equations arising from scattering and instrumental broadening. The general- 
ity of this formalism enables any crystallite shape to be used and any number of principal 
axes, D = {D\, D2, D3}, of the crystallite shape can be included in determining the 
corresponding size distributions. 

Simulated data were used to test the fuzzy pixel and Bayesian/MaxEnt methods on 
size-broadened line profiles. The reliability of these methods was established by showing 
that they can reproduce the underlying parameters of the area- and volume-weighted 
sizes, and the parameters of the size distributions. 
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The application of these methods to Ce02 experimental data generally produced very 
good results. The line profile analysis applying fuzzy pixel/MaxEnt methods produced 
reliable and consistent results over a wide range of low-, mid- and high- angle profiles. 

The application of the Bayesian/MaxEnt method to the Ce02 data demonstrated 
that this method can determine size distributions, while making the minimum number of 
assumptions. The use of a uniform a priori model produced broadly consistent results 
with the fuzzy pixel/MaxEnt method; however, the lack of information defined in this 
model was evident in the large uncertainties of the estimated quantities. 

Using the fuzzy pixel/MaxEnt results as the log-normal a priori model demonstrated 
that once 'useful' information is encoded in the model, improvements in the size distri- 
butions and considerable reduction in the uncertainties can be achieved. Analysis of the 
x-ray diffraction profiles using the log- normal model in the Bayesian/MaxEnt method re- 
vealed that the crystallites are spherical in shape, with a size distribution corresponding 
to the distribution in Fig. Eland average quantities in Table 03 The comparison of these 
Bayesian/MaxEnt results with TEM results is favorable, but it does reveal shortcom- 
ings in the collected TEM data arising from particle aggregation. The TEM distribution 
micrographs support the results from the line profile analysis. 

The use of simulated and experimental data demonstrates that the fuzzy pixel/Bayesian/ 
MaxEnt methods are fully quantitative in their ability to determine and attribute errors 
to the solution line profiles and size distributions. 

Although the results from the Bayesian/MaxEnt method are in good agreement and 
address the limitations of the earlier work (see Armstrong & Kalceff 1999, Armstrong 
1999), several important issues have been raised and are the subject of further investi- 
gation. These concern the accurate background estimation of the observed line profile 
and are very important; for example, the analysis of simulated data demonstrated how 
systematic errors affect the Fourier coefficients. Recently, David & Sivia (2001) have de- 
veloped a Bayesian technique for estimating the background, which can be adopted in this 
method. Another problem encountered was in the estimation and quantifying of a non- 
uniform a priori model. In this analysis we have used the information determined from 
the fuzzy pixel/MaxEnt method; however, the issue of determining the a priori model 
can also be addressed in a Bayesian context, by using a process of model selection (Sivia 
et al. 1993, Loredo & Lamb 2001) and defining an a posteriori distribution of parameters 
in the model (Jarrell & Gubernatis 1996). Further, only single line profiles were analyzed 
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here; while the formalism has been expressed for overlapped line profiles, demonstrat- 
ing that Bayesian/MaxEnt method is flexible in its application, additional analysis of 
overlapped line profiles is needed and this will be followed in future studies. 

The literature has seen considerable debate over the type of distribution that best 
describes the distribution of sizes (see Blackman et al. 1994, Krill & Birringer 1998, Kiss 
et al. 1999, Langford et al. 2000). In the analysis presented here we have simply used a 
log-normal distribution to demonstrate that the Bayesian/MaxEnt method can reproduce 
the parameters. Moreover, the position we have taken in developing the Bayesian/MaxEnt 
method is that we are not concerned with the type of distribution; rather, we have pro- 
duced a reliable and consistent method that can determine the specimen profile and/or 
the size distribution, given our understanding of the experimental data, statistical noise 
and instrumental effects. 
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